Multilingual Data-Driven Pronunciation
نویسنده
چکیده
Automatic pronunciation of unknown words is a hard problem of great importance in speech technology. The difficulty of the problem appears to vary across languages, according to the ‘depth’ of orthography, although there are, as yet, little by way of quantitative comparisons. Early solutions were based on manually-written expert rules but these are expensive to derive and maintain, are entirely language-specific and—according to more recent evaluations using large test sets—do not perform very well (at least for English). By contrast, data-driven methods (which infer pronunciations from a set of examples) require little expert knowledge, can be quickly and easily applied to new languages for which an appropriate set of examples is available, and seem to perform far better than rules. In this paper, we compare the success of data-driven automatic pronunciation for four European languages: English, French, Frisian and German. We also attempt to quantify the difficulty of the problem using the entropy of the association between letters and phonemes found after prior alignment of the various dictionaries.
منابع مشابه
Multilingual Pronunciat Improving Multilingual S
Multilinguality aspects are becoming increasingly important in the Automatic Speech Recognition (ASR) systems. It is apparent that coping with large variability of the speech signal is an even bigger challenge in multilingual ASR systems than it has been in conventional monolingual systems. In this paper, we address the importance of combining multilingual pronunciation modeling and acoustic mo...
متن کاملPronunciation and Acoustic Model Adaptation for Improving Multilingual Speech Recognition
In this paper, we address the importance of pronunciation and acoustic model adaptation in multilingual speech recognition. When aiming at modeling several languages simultaneously, the degree of speaker and language variability is even greater than when concentrating on only one language. To compensate the pronunciation variability across various speaker, bi-lingual pronunciation modeling is p...
متن کاملGlobalPhone: Pronunciation Dictionaries in 20 Languages
This paper describes the advances in the multilingual text and speech database GLOBALPHONE a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages. GLOBALPHONE was designed to be uniform across languages with respect to the amount of data, speech quality, the collection scenario, the transcription and phone set convent...
متن کاملTowards multilingual interoperability in automatic speech recognition
In this communication, we address multilingual interoperability aspects in speech recognition. After giving a tentative definition of multilingual interoperability, we discuss speech recognition components and their language-specific aspects. We give a sample overview of past multilingual speech recognition research and development across different speaking styles (read, prepared and conversati...
متن کاملMultilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques
In this paper, a multilingual acoustic model set derived from English, Hindi, and Spanish is utilised to recognise speech in Indonesian. In order to achieve this task we incorporate a two tiered approach to perform the cross-lingual porting of the multilingual models to a new language. In the first stage, we use an entropy based decision tree to merge similar phones from different languages int...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009